$EVA^2$ : Exploiting Temporal Redundancy in Live Computer Vision
نویسندگان
چکیده
Hardware support for deep convolutional neural networks (CNNs) is critical to advanced computer vision in mobile and embedded devices. Current designs, however, accelerate generic CNNs; they do not exploit the unique characteristics of real-time vision. We propose to use the temporal redundancy in natural video to avoid unnecessary computation on most frames. A new algorithm, activation motion compensation, detects changes in the visual input and incrementally updates a previously-computed output. The technique takes inspiration from video compression and applies well-known motion estimation techniques to adapt to visual changes. We use an adaptive key frame rate to control the trade-off between efficiency and vision quality as the input changes. We implement the technique in hardware as an extension to existing state-of-the-art CNN accelerator designs. The new unit reduces the average energy per frame by 54.2%, 61.7%, and 87.6% for three CNNs with less than 1% loss in vision accuracy.
منابع مشابه
Research Statement — Qifa Ke
My primary research interests are computer vision and machine learning, with an emphasis on video analysis and its applications. I believe a working computer vision system should exploit three most fundamental constraints existing in a video sequence: the geometric constraint among video frames and the threedimensional (3D) scene, the coherency in apparent image motions due to scene regularitie...
متن کاملExploiting the overlap between temporal redundancy and spatial redundancy in storage system
Recent years have seen ever increasing file systems and storage applications employ versioning or log-based techniques to protect data integrity and improve the overall system performance, in addition to traditional database systems. The basic idea behind these techniques is to add extra data redundancy before committing new updates on disks. More specifically, in database systems, the log reco...
متن کاملRobot Motion Vision Pait I: Theory
A direct method called fixation is introduced for solving the general motion vision problem, arbitrary motion relative to an arbitrary environment. This method results in a linear constraint equation which explicitly expresses the rotational velocity in terms of the translational velocity. The combination of this constraint equation with the Brightness-Change Constraint Equation solves the gene...
متن کاملA PRACTICAL APPROACH TO REAL-TIME DYNAMIC BACKGROUND GENERATION BASED ON A TEMPORAL MEDIAN FILTER
In many computer vision applications, segmenting and extraction of moving objects in video sequences is an essential task. Background subtraction, by which each input image is subtracted from the reference image, has often been used for this purpose. In this paper, we offer a novel background-subtraction technique for real-time dynamic background generation using color images that are taken fro...
متن کاملA scheme for spatial scalability using nonscalable encoders
We describe a scheme that achieves spatially scalable coding of video by employing nonscalable video encoders (e.g., MPEG-2 main profile), along with a downsampler and an upsampler. The scheme is illustrated for the case of coding video at two resolutions. The enhancement layer is coded in two steps by first exploiting the spatial redundancy and then exploiting the temporal redundancy. Hence, t...
متن کامل